Add code execution harness with tool-use support by tuhinkanti · Pull Request #2 · tuhinkanti/openclaw-java

tuhinkanti · 2026-02-18T05:44:43Z

New ai.openclaw.tool package: Tool interface, ToolResult, CodeExecutionTool
CodeExecutionTool runs shell commands via ProcessBuilder with timeout
LlmResponse structured response type for Anthropic content blocks
LlmProvider.completeWithTools() for tool-use API integration
AnthropicProvider supports full Anthropic tool-use protocol
AgentExecutor agentic loop: call LLM -> execute tools -> loop (max 10)
Message extended with toolUseId, toolError, contentBlocks fields
System prompt updated with code execution instructions
5 new unit tests for CodeExecutionTool

- New ai.openclaw.tool package: Tool interface, ToolResult, CodeExecutionTool - CodeExecutionTool runs shell commands via ProcessBuilder with timeout - LlmResponse structured response type for Anthropic content blocks - LlmProvider.completeWithTools() for tool-use API integration - AnthropicProvider supports full Anthropic tool-use protocol - AgentExecutor agentic loop: call LLM -> execute tools -> loop (max 10) - Message extended with toolUseId, toolError, contentBlocks fields - System prompt updated with code execution instructions - 5 new unit tests for CodeExecutionTool

New tools: - FileReadTool: reads files, lists directories - FileWriteTool: writes/creates files with auto-created dirs - WebSearchTool: fetches URLs, strips HTML to text Infrastructure: - Dockerfile: multi-stage build (Gradle 8 + JDK 21 → Alpine JRE) - .github/workflows/ci.yml: build+test + Docker build verification - .dockerignore: excludes build artifacts from Docker context Tests: - FileReadToolTest: 4 tests (read, missing, directory, metadata) - FileWriteToolTest: 4 tests (create, nested dirs, overwrite, metadata) - WebSearchToolTest: 4 tests (strip HTML, scripts, invalid URL, metadata)

…ol message persistence - AnthropicProvider: merge consecutive tool_result messages into a single user message with multiple content blocks (Anthropic API requirement) - FileReadTool: close Files.list() stream with try-with-resources to prevent file descriptor leak on directory listings - GatewayE2ETest: use random available port instead of hardcoded 18790 to prevent BindException when tests run in quick succession - AgentExecutor: persist intermediate tool messages (assistant_tool_use and tool_result) via sessionStore.appendMessage() so they survive process restarts and session replay is complete

Docker: - Run as non-root 'openclaw' user with dedicated /home/openclaw/workspace - Install bash/curl for tool execution Tool confinement: - CodeExecutionTool: default working directory changed to ~/workspace - FileReadTool: workspace-confined with path normalization validation, rejects paths outside workspace root (../escape and absolute paths) - FileWriteTool: same workspace confinement as FileReadTool - FileReadTool: large file read now uses bounded BufferedReader instead of Files.readString() to prevent OOM on multi-GB files Tests (22 total): - FileReadToolTest: 7 tests (read, relative, missing, dir, path escape, absolute outside, metadata) - FileWriteToolTest: 6 tests (create, dirs, overwrite, path escape, absolute outside, metadata) - CodeExecutionToolTest: uses explicit temp dir for working directory

- CodeExecutionTool: replace StringBuilder with StringBuffer for the shared output buffer accessed by both main and reader threads - Use untimed readerThread.join() in non-timeout path to guarantee the reader finishes before accessing the buffer

- 11 blocked patterns: rm -rf /, mkfs, dd to devices, curl|sh, shutdown, reboot, chmod 777 /, chown /, kill -9 1, overwrite /etc/ - 10 warned patterns: rm, mv, chmod, chown, curl, wget, sudo, pip/npm/apt install - Blocked commands return error immediately without execution - Warned commands log at WARN level before executing - Both pattern lists configurable via constructor - 8 new unit tests covering blocked/safe command detection

The regex now matches -r anywhere in a combined flag group, not just as a separate flag. Catches rm -fr /, rm -fir /, rm -fr * etc. Added regression tests for these bypass variants.

- Block absolute path reads via cat/head/tail/less/more/vi/vim/nano to paths outside /home/*/workspace (negative lookahead) - Block SSRF: curl/wget to 169.254.x.x (cloud metadata), 127.0.0.1, localhost, [::1], 10.x.x.x, 172.16-31.x.x, 192.168.x.x - Block symlink creation (ln -s) to prevent workspace escape via symlinks that bypass FileReadTool/FileWriteTool path validation - 4 new tests: absolute path read, workspace read allowed, SSRF, symlink

- Validate URL before making request: resolve hostname to IP(s) and reject loopback, site-local (private), link-local (cloud metadata), any-local, and multicast addresses via InetAddress built-in checks - Block non-http/https schemes (file://, ftp://, etc.) - Disable redirect-following to prevent redirect-based bypass - 6 new tests: loopback, localhost-by-name, cloud metadata (169.254.x.x), private ranges, non-HTTP schemes, public URL allowed

Track start time before waitFor(), compute remaining timeout budget after process exits, and use that as the join timeout (+2s grace). If the reader thread is still alive after the budget (background child inherited stdout), interrupt it and log a warning. This bounds total wall-clock time to at most timeoutSeconds + 2s, preventing indefinite hangs from commands like 'nohup daemon &'.

devin-ai-integration

Devin Review found 2 new potential issues.

View 14 additional findings in Devin Review.

devin-ai-integration · 2026-02-18T06:52:20Z

src/main/java/ai/openclaw/tool/CodeExecutionTool.java

+            Pattern.compile("\\bcurl\\s+.*\\|\\s*sh"),
+            Pattern.compile("\\bwget\\s+.*\\|\\s*sh"),


devin-ai-integration · 2026-02-18T06:52:22Z

src/main/java/ai/openclaw/tool/CodeExecutionTool.java

+            Pattern.compile("\\b(shutdown|reboot|halt|poweroff)\\b"),
+            Pattern.compile("\\bkill\\s+-9\\s+1\\b"),
+            // Absolute path reads outside workspace (cat, head, tail, less, more, vi, nano)
+            Pattern.compile("\\b(cat|head|tail|less|more|vi|nano|vim)\\s+/(?!home/[^/]+/workspace)"),


🔴 Absolute-path-read blocklist bypassed via ../ traversal after workspace prefix

The regex for blocking reads of absolute paths outside the workspace can be bypassed using path traversal sequences like cat /home/openclaw/workspace/../../../etc/passwd.

Root Cause and Impact

The pattern at src/main/java/ai/openclaw/tool/CodeExecutionTool.java:50 is:

Pattern.compile("\\b(cat|head|tail|less|more|vi|nano|vim)\\s+/(?!home/[^/]+/workspace)")

The negative lookahead (?!home/[^/]+/workspace) only checks whether the characters immediately following / match home/<user>/workspace. So a path like /home/openclaw/workspace/../../../etc/passwd passes the lookahead (since it starts with home/openclaw/workspace), but the shell resolves ../ segments and ultimately reads /etc/passwd.

Verified behavior:

cat /etc/passwd → blocked ✓

cat /home/openclaw/workspace/file.txt → allowed ✓

cat /home/openclaw/workspace/../../../etc/passwd → allowed ✗ (should be blocked)

Impact: An attacker (or LLM-crafted command) can read arbitrary files outside the workspace by prefixing the path with the allowed workspace directory and then using ../ to traverse out.

Prompt for agents

In src/main/java/ai/openclaw/tool/CodeExecutionTool.java line 50, the regex negative-lookahead approach cannot reliably prevent path traversal via ../ sequences. Consider adding a secondary check: after regex matching, also block commands where the argument path contains ".." segments. One approach is to add another blocked pattern like Pattern.compile("\b(cat|head|tail|less|more|vi|nano|vim)\s+\S*\.\.") to catch any traversal attempts. Alternatively, resolve the path argument to its canonical form before checking whether it falls within the workspace, though this is harder to do purely via regex on shell commands.

Was this helpful? React with 👍 or 👎 to provide feedback.

This comment was marked as resolved.

Sign in to view

tuhinkanti added 2 commits February 17, 2026 22:22

This comment was marked as resolved.

Sign in to view

tuhinkanti added 2 commits February 17, 2026 22:36

Fix rm blocklist regex to catch flag reordering (-fr, -fir)

7472953

The regex now matches -r anywhere in a combined flag group, not just as a separate flag. Catches rm -fr /, rm -fir /, rm -fr * etc. Added regression tests for these bypass variants.

This comment was marked as resolved.

Sign in to view

tuhinkanti added 2 commits February 17, 2026 22:44

tuhinkanti merged commit 0272777 into main Feb 18, 2026
5 of 6 checks passed

tuhinkanti deleted the feature/code-execution-harness branch February 18, 2026 06:51

devin-ai-integration bot reviewed Feb 18, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add code execution harness with tool-use support#2

Add code execution harness with tool-use support#2
tuhinkanti merged 10 commits intomainfrom
feature/code-execution-harness

tuhinkanti commented Feb 18, 2026 •

edited by devin-ai-integration bot

Loading

Uh oh!

This comment was marked as resolved.

Uh oh!

This comment was marked as resolved.

Uh oh!

This comment was marked as resolved.

Uh oh!

This comment was marked as resolved.

Uh oh!

This comment was marked as resolved.

Uh oh!

This comment was marked as resolved.

Uh oh!

Uh oh!

devin-ai-integration bot left a comment

Uh oh!

devin-ai-integration bot Feb 18, 2026

Uh oh!

devin-ai-integration bot Feb 18, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Comments

		Pattern.compile("\\bcurl\\s+.\\\|\\ssh"),
		Pattern.compile("\\bwget\\s+.\\\|\\ssh"),

Conversation

tuhinkanti commented Feb 18, 2026 • edited by devin-ai-integration bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

This comment was marked as resolved.

Uh oh!

This comment was marked as resolved.

Uh oh!

This comment was marked as resolved.

Uh oh!

This comment was marked as resolved.

Uh oh!

This comment was marked as resolved.

Uh oh!

This comment was marked as resolved.

Uh oh!

Uh oh!

devin-ai-integration bot left a comment

Choose a reason for hiding this comment

Uh oh!

devin-ai-integration bot Feb 18, 2026

Choose a reason for hiding this comment

Uh oh!

devin-ai-integration bot Feb 18, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Comments

tuhinkanti commented Feb 18, 2026 •

edited by devin-ai-integration bot

Loading